What is claimed is: 

1. A method for classifyingjsignals comprising: 

dividing an input signal into blocks having a predetermined time length; 

extracting one or more than one characteristic quantities of a signal 
attribute from the signal of eacn block; and 

classifying the signal or each block into a category according to the 
characteristic quantities thereof. 

2. The method for classifying signals according to claim 1, wherein said 
signal of each block is classified into any of the categories formed on the basis 
of types of signal sourcesj 

3. The method for Classifying signals according to claim 1, wherein said 
signal of each block is classified into any of the categories formed on the basis 
of types structures that signals may have and do not depend on the types of 
signal sources. 

4. The method' for classifying signals according to claim 2, wherein 
said input signal is an audio signal; and 

the categorie/ formed on the basis of signal sources for classifying the 
audio signal of eac^ block include one or more than one of silence, voice, male 
voice, female voile, music, vocal music, instrumental music, noise, striking 
sound, environmental sound, sound of hustle and bustle, clapping sound and 
cheering sound and are used for categorical classification based on the sound 
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sources. 

5. The method for classifying signals according to claim 3, wherein 
said input signal is aiy audio signal; and 

the categories formed on the basis of structures that signals may have 
and do not depend on the types of signal sources for classifying the audio 
signal of each block include one or more than one of a silence structure where 
no significant sound exists in the block, a single sound source structure where 
only a sound related to a single sound source exists in the block, a double 
sound source structure where sounds related respectively to two sound sources 
exist in the block, a sound source change structure where a sound source 
including silence is switched only for once in the block, a multiple sound 
source change^ structure where a plurality of sound sources are switched 
simultaneously in the block, a sound source partial change structure where part 
of a plurality^of sound sources are switched in the bock and an extra structure 
pattern where none of the above patterns is applicable and are used for 
categorica/classification based on the structures. 

6. The method for classifying signals according to claim 1, wherein one 
or more than one of the average and variances of the signal power in the block, 
the average and variances of the power of a band-pass signal of the signal in 
the blodk, the average and variances of the spread of the spectrogram of the 
signal in the block, the average and variances of the pitch frequency of the 
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analysis of the signal in 



signal in the block, the average and>afiances of the degree of harmonic 
structurization of the signal^itfme block, the average and variances of the 
residue signal of linear foredictive analysis of the signal in the block and the 
average and variances of the pitch gain of the residue signal of linear predictive 

the block are used as said characteristic quantities. 
7. The method for classifying signals according to claim 6, wherein 
said average of tWe degree of harmonic structurization is the temporal 
average of the ratio of tjhe energy of the sound component of integer times of 
the pitch frequency to tjie energy of all the frequencies; and 

the degree of harmonic structurization is the temporal 
e ratio of the energy of the sound component of integer 
iiency to the energy of all the frequencies, 
for classifying signals according to claim 1, wherein a 
technique is used as method for the categorical 



said variances of 
standard deviation of th 
times of the pitch freq 



8. The method 
vector quantization 
classification. 

9. An apparatus 



predetermined time 1 



for classifying signals comprising: 



a blocking means for dividing an input signal into blocks having a 



a feature extracting means for extracting one or more than one 



characteristic quanti 



sngth; 



ies of a signal attribute from the signal of each block; and 



a categorical c lassifying means for classifying the signal of each block 
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into a category according to the characteristic quantities thereof. 

10. The apparatus ferula ssifying signals according to claim 9, wherein 
said categorical cla&sitying means classifies said signal of each block into any 
of the categories/formed on the basis of types of signal sources. 

11. The apparatus for classifying signals according to claim 9, wherein 
classifying means classifies said signal of each block into any 

;s formed on the basis of types of structures that signals may 
depend on the types of signal sources. 

12. The apparatus for classifying signals according to claim 10, wherein 
said input signal is an audio signal; and 

the categories formed on the basis of signal sources for classifying the 
each block include one or more than one of silence, voice, male 
voice, female voice, music, vocal music, instrumental music, noise, striking 
sound, environmental sound, sound of hustle and bustle, clapping sound and 
cheering sounc and are used for categorical classification based on the sound 



said categorica 
of the categories 
have and do no 



audio signal of 



sources. 

13. The 



apparatus for classifying signals according to claim 1 1, wherein 



said i 
the 
and do not 



input 



categories 



depend 



signal is an audio signal; and 

formed on the basis of structures that signals may have 
on the types of signal sources for classifying the audio 
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signal of each block include one or m oreJ^MtToneo f a silence structure where 
no significant sound existsjirflie block, a single sound source structure where 
only a sound related/to a single sound source exists in the block, a double 
sound source structure where sounds related respectively to two sound sources 
exist in the block, a sound source change structure where a sound source 
including silence is switched only for once in the block, a multiple sound 
source change/ structure where a plurality of sound sources are switched 
simultaneously in the block, a sound source partial change structure where part 
of a plurality of sound sources are switched in the bock and an extra structure 
pattern where none of the above patterns is applicable and are used for 
categorical classification based on the structures. 

14. Tt e apparatus for classifying signals according to claim 9, wherein 
extracting means uses one or more than one of the average and 
the signal power in the block, the average and variances of the 
band-pass signal of the signal in the block, the average and 
the spread of the spectrogram of the signal in the block, the 
variances of the pitch frequency of the signal in the block, the 
variances of the degree of harmonic structurization of the signal 
he average and variances of the residue signal of linear predictive 
analysis of the signal in the block and the average and variances of the pitch 
gain of the residue signal of linear predictive analysis of the signal in the block 



said feature 
variances of 
power of a 
variances of 
average and 
average and 
in the block, 
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as said characteristic quantities. 

15. The apparatus for classifying signals according to claim 14, wherein 
said average of the degree of harmonic structurization is the temporal 

average of the ratio of the energy of the sound / component of integer times of 
the pitch frequency to the energy of all the / frequencies; and 

said variances of the degree of harmonic structurization is the temporal 
standard deviation of the ratio of the'energy of the sound component of integer 
times of the pitch frequency to th4 energy of all the frequencies. 

16. The apparatus for classifying signals according to claim 9, wherein 
said categorical classifying means uses a vector quantization technique as 



method for the categorical classification. 

17. A method for generating descriptors comprising: 
dividing an input signal into blocks having a predetermined time length; 
extracting one or more than one characteristic quantities of a signal 
attribute from the signal of each block; 

classifying the signal of each block into a category according to the 
characteristicyquantities thereof; and 

generating a descriptor for the signal according to the category of 
classification thereof. 

18/ The method for generating descriptors according to claim 17, 
whereii/ said signal of each block is classified into any of the categories formed 
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on the basis of ty 



on the basis of types of sign^L&eufces. 

19. The /hethod for generating descriptors according to claim 17, 
wherein said signjal of each block is classified into any of the categories formed 

>es of structures that signals may have and do not depend on 
the types of signail sources. 

20. The qiethod for generating descriptors according to claim 18, 
wherein 

said input feignal is an audio signal; and 

the categories formed on the basis of signal sources for classifying the 
audio signal of each block include one or more than one of silence, voice, male 
voice, female voice, music, vocal music, instrumental music, noise, striking 
sound, environmental sound, sound of hustle and bustle, clapping sound and 
cheering sounq and are used for categorical classification based on the sound 
sources. 

21. Th(e method for generating descriptors according to claim 19, 
wherein 

said indut signal is an audio signal; 

the categories formed on the basis of structures that signals may have 
and do not depend on the types of signal sources for classifying the audio 
signal of eaoh block include one or more than one of a silence structure where 
no significant sound exists in the block, a single sound source structure where 
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only a sound related to a single sound sourc&^exists in the block, a double 
sound source structure where soujid^related respectively to two sound sources 
exist in the block, a s,otmd source change structure where a sound source 
including silence ts switched only for once in the block, a multiple sound 
source change Structure where a plurality of sound sources are switched 
simultaneously In the block, a sound source partial change structure where part 
of a plurality c/f sound sources are switched in the bock and an extra structure 
pattern where none of the above patterns is applicable and are used for 
categorical classification based on the structures; and 

a descriptor is generating according to the categorical classification 
based on the structures. 

22. [The method for generating descriptors according to claim 17, 
wherein ond or more than one of the average and variances of the signal power 
in the block, the average and variances of the power of a band-pass signal of 
the signal! in the block, the average and variances of the spread of the 
spectrogra m of the signal in the block, the average and variances of the pitch 
frequency of the signal in the block, the average and variances of the degree 
of harmonic structurization of the signal in the block, the average and 



variances 



^ of the residue signal of linear predictive analysis of the signal in the 
block and the average and variances of the pitch gain of the residue signal of 
linear predictive analysis of the signal in the block are used as said 
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4 



4 



times of the pit 



characteristic 

23. \The method for generating descriptors according to claim 22, 
wherein 

said avbrage of the degree of harmonic structurization is the temporal 
average of theVatio of the energy of the sound component of integer times of 
the pitch frequency to the energy of all the frequencies; and 

said variances of the degree of harmonic structurization is the temporal 
standard deviation of the ratio of the energy of the sound component of integer 
:h frequency to the energy of all the frequencies. 

24. Th^ method for generating descriptors according to claim 17, 
wherein a vector quantization technique is used as method for the categorical 
classification. 

25. An apparatus for generating descriptors comprising: 
a blocking means for dividing an input signal into blocks having a 

predetermined time length; 

a feature extracting means for extracting one or more than one 
characteristic quantities of a signal attribute from the signal of each block; 

a categorical classifying means for classifying the signal of each block 
into a category according to the characteristic quantities thereof; and 

a descriptor generating means for generating a descriptor for the signal 
according to i he category of classification thereof. 
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26. The apparatus for generating descriptors according to claim 25, 
wherein said categorical classi^ingineans classifies said signal of each block 
into any of the categori^siormed on the basis of types of signal sources. 

27. The apparatus for generating descriptors according to claim 25, 
wherein said categorical classifying means classifies said signal of each block 
into any of the categories formed on the basis of types of structures that signals 
may have and do not depend on the types of signal sources. 



28 

wherein 

said 
the 4 

audio sign a 



sources. 
29 

wherein 

said 



The apparatus for generating descriptors according to claim 26, 



nput signal is an audio signal; and 

ategories formed on the basis of signal sources for classifying the 
of each block include one or more than one of silence, voice, male 
voice, fenlale voice, music, vocal music, instrumental music, noise, striking 
sound, environmental sound, sound of hustle and bustle, clapping sound and 
cheering sound and are used for categorical classification based on the sound 



The apparatus for generating descriptors according to claim 27, 



input signal is an audio signal; 
the tategories formed on the basis of structures that signals may have 
and do net depend on the types of signal sources for classifying the audio 



-52- 




signal of each block include one or more than one of a silence structure where 
no significant sound exists in the block, a single soun^source structure where 
only a sound related to a single sound source exists in the block, a double 
sound source structure where sounds related^respectively to two sound sources 
exist in the block, a sound source change structure where a sound source 
including silence is switched only for once in the block, a multiple sound 
source change structure where & plurality of sound sources are switched 
simultaneously in the block, a sound source partial change structure where part 
of a plurality of sound sources are switched in the bock and an extra structure 
pattern where none of Ae above patterns is applicable and are used for 
categorical classification based on the structures; and 

said descriptor generating means generates a descriptor according to the 
categorical classification based on the structures. 

30. The/apparatus for generating descriptors according to claim 25, 
wherein said /eature extracting means uses one or more than one of the average 
and variances of the signal power in the block, the average and variances of the 
power of a band-pass signal of the signal in the block, the average and 
varianc4s of the spread of the spectrogram of the signal in the block, the 
average and variances of the pitch frequency of the signal in the block, the 
average and variances of the degree of harmonic structurization of the signal 
in/the block, the average and variances of the residue signal of linear predictive 
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analysis of the signal in the block and the average and variances of the pitch 
gain of the residue signal of linear predictive analysis of the signal in the block 
as said characteristic quantities. 

31. The apparatus for generating descriptors according to claim 30, 
wherein 

said average of the degree of harm6nic structurization is the temporal 
average of the ratio of the energy of the sound component of integer times of 
the pitch frequency to the energy ofall the frequencies; and 

said variances of the degree of harmonic structurization is the temporal 

a / 

standard deviation of the ratio/of the energy of the sound component of integer 



r 1 ^ 
i 



Ifl times of the pitch frequency to the energy of all the frequencies, 
m 32. The apparatus/for generating descriptors according to claim 25, 

s wherein said categoricalxlassifying means uses a vector quantization technique 

It as method for the categorical classification. 

r'il / 

33. A method for retrieving signals comprising: 

dividing ar/ input signal into blocks having a predetermined time length; 
extracting one or more than one characteristic quantities of a signal 
attribute from the signal of each block; 

classifying the signal of each block into a category according to the 
characteristic quantities thereof; and 

retrieving the signal according to the result of categorical classification 
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or by using a descriptor generated _a£&erdmg"To the result of categorical 
classification. 

34. The niefkod for retrieving signals according to claim 33, wherein 
said signal of^ach block is classified into any of the categories formed on the 
basis of types of signal sources. 

35. The method for retrieving signals according to claim 33, wherein 
said signal ofteach block is classified into any of the categories formed on the 
basis of types of structures that signals may have and do not depend on the 



types of signa 
36. The 



sources. 

method for retrieving signals according to claim 34, wherein 
said input signal is an audio signal; 

the categories formed on the basis of signal sources for classifying the 
audio signal of sach block include one or more than one of silence, voice, male 
voice, female voice, music, vocal music, instrumental music, noise, striking 
sound, environmental sound, sound of hustle and bustle, clapping sound and 
cheering sound and are used for categorical classification based on the sound 
sources; and 

a signal is; retrieved by using the descriptor reflecting or corresponding 
to the result of s aid categorical classification based on the sound sources. 

37. The riethod for retrieving signals according to claim 35, wherein 
said input signal is an audio signal; 



V 
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the categories formed on the basis of^tr^icttrreTThat signals may have 
and do not depend on the types^of^signal sources for classifying the audio 
signal of each block include one or more than one of a silence structure where 
no significant soundexists in the block, a single sound source structure where 
only a sound re/ated to a single sound source exists in the block, a double 
sound source stipcture where sounds related respectively to two sound sources 
exist in the block, a sound source change structure where a sound source 
including silence is switched only for once in the block, a multiple sound 



source chang 
simultaneousl 
of a plurality 



structure where a plurality of sound sources are switched 
r in the block, a sound source partial change structure where part 
of sound sources are switched in the bock and an extra structure 
pattern when; none of the above patterns is applicable and are used for 
categorical classification based on the structures; and 

is retrieved by using the descriptor reflecting or corresponding 
f said categorical classification based on the structure, 
le method for retrieving signals according to claim 33, wherein 
than one of the average and variances of the signal power in the 
block, the average and variances of the power of a band-pass signal of the 



a signal 
to the result p 

38. T 
one or more 



signal in the 
of the signa 
the signal ii 



block, the average and variances of the spread of the spectrogram 
in the block, the average and variances of the pitch frequency of 
the block, the average and variances of the degree of harmonic 
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structurization of the signal in the block, tlj^-avSrage 1 and variances of the 
residue signal of linear predictiyj&^nalysis of the signal in the block and the 
average and variances ofjhe pitch gain of the residue signal of linear predictive 
analysis of the signal in the block are used as said characteristic quantities. 

39. The method for retrieving signals according to claim 38, wherein 
said average of the degree of harmonic structurization is the temporal 

average of the ratio of the energy of the sound component of integer times of 
the pitch frequency to the energy of all the frequencies; and 

said variances of the degree of harmonic structurization is the temporal 
standard deviation of the ratio of the energy of the sound component of integer 
times of the sitch frequency to the energy of all the frequencies. 

40. T le method for retrieving signals according to claim 33, wherein a 
vector quantization technique is used as method for the categorical 
classification. 

41. The method for retrieving signals according to claim 33, wherein 
points of changes of the signal are detected by using the descriptor reflecting 
or corresponding to the result of said categorical classification. 

42. A apparatus for retrieving signals comprising: 

a bloc cing means for dividing an input signal into blocks having a 
predetermined time length; 

a feature extracting means for extracting one or more than one 
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characteristic quantities of a signal attributeJtenrtfie^ignal of each block; 

a categorical classifyipg^neans for classifying the signal of each block 
into a category according to the characteristic quantities thereof; and 

a signal retrieving means for retrieving the signal according to the result 
of categorical classification or by using a descriptor generated according to the 
result of categorical classification. 

43. Thfe apparatus for retrieving signals according to claim 42, wherein 
said categorical classifying means classifies said signal of each block into any 
of the categories formed on the basis of types of signal sources. 

44. The apparatus for retrieving signals according to claim 42, wherein 
said categorical classifying means classifies said signal of each block into any 
of the categories formed on the basis of types of structures that signals may 
have and do not depend on the types of signal sources. 

45. / The apparatus for retrieving signals according to claim 43, wherein 
said input signal is an audio signal; 

the categories formed on the basis of signal sources for classifying the 
audio signal of each block include one or more than one of silence, voice, male 
voice, female voice, music, vocal music, instrumental music, noise, striking 
sound, environmental sound, sound of hustle and bustle, clapping sound and 
cheerirg sound and are used for categorical classification based on the sound 
source ?; and 
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said signal retrieving means retrieves a signal by using the descriptor 
reflecting or corresponding t the result of said categorical classification based 
on the sound sources. / 

46. The apparatus for retrieving signals according to claim 44, wherein 

said input signal is an audio signal, 

the categories formed on the basis of structures that signals may have 
and do not depend on the types/of signal sources for classifying the audio 
signal of each block include ovic or more than one of a silence structure where 
no significant sound existsyfa the block, a single sound source structure where 
only a sound related to/a single sound source exists in the block, a double 
sound source structure where sounds related respectively to two sound sources 
exist in the block/a sound source change structure where a sound source 
including silencp is switched only for once in the block, a multiple sound 
source change^ structure where a plurality of sound sources are switched 
simultaneously in the block, a sound source partial change structure where part 
of a plural/ty of sound sources are switched in the bock and an extra structure 
pattern where none of the above patterns is applicable and are used for 
categorical classification based on the structures; and 

/ said signal retrieving means retrieves a signal by using the descriptor 
reflecting or corresponding to the result of said categorical classification based 
01 the structure. 



-59- 



analysis of the si 



47. The apparatus for retrieving signaj^aec^rSing to claim 42, wherein 
said feature extracting means usps^me or more than one of the average and 
variances of the signaLp^wer in the block, the average and variances of the 
power of a band-pyass signal of the signal in the block, the average and 
variances of the spread of the spectrogram of the signal in the block, the 
average and variances of the pitch frequency of the signal in the block, the 
average and variances of the degree of harmonic structurization of the signal 
in the block, the ayerage and variances of the residue signal of linear predictive 

i&nal in the block and the average and variances of the pitch 
gain of the residue signal of linear predictive analysis of the signal in the block 
as said character istic quantities. 

48. The apparatus for retrieving signals according to claim 47, wherein 
said average of the degree of harmonic structurization is the temporal 

average of the ratio of the energy of the sound component of integer times of 
the pitch freque icy to the energy of all the frequencies; and 

said variances of the degree of harmonic structurization is the temporal 
standard deviation of the ratio of the energy of the sound component of integer 
times of the pitch frequency to the energy of all the frequencies. 

49. The apparatus for retrieving signals according to claim 42, wherein 
classifying means uses a vector quantization technique as 

^ategorical classification. 



said categories 
method for the 
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50. The apparatus for retrieving signals according to claim 42, wherein 
said signal retrieving'means detects points of changes of the signal by using the 
descriptor reflecting or corresponding to the result of said categorical 
classification. 
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